Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 4372 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 478.3 KiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 14 |
|---|
201103 is highly correlated with 201012 and 7 other fields | High correlation |
201012 is highly correlated with 201103 and 4 other fields | High correlation |
201101 is highly correlated with 201103 and 1 other fields | High correlation |
201102 is highly correlated with 201103 and 1 other fields | High correlation |
201104 is highly correlated with 201012 and 4 other fields | High correlation |
201105 is highly correlated with 201102 and 2 other fields | High correlation |
201106 is highly correlated with 201105 and 1 other fields | High correlation |
201107 is highly correlated with 201012 and 4 other fields | High correlation |
201110 is highly correlated with 201106 | High correlation |
201111 is highly correlated with 201012 and 5 other fields | High correlation |
201112 is highly correlated with 201012 and 4 other fields | High correlation |
201012 is highly skewed (γ1 = 61.08787263) | Skewed |
201101 is highly skewed (γ1 = 45.79007171) | Skewed |
201102 is highly skewed (γ1 = 44.60023013) | Skewed |
201103 is highly skewed (γ1 = 52.70101885) | Skewed |
201104 is highly skewed (γ1 = 55.3589938) | Skewed |
201105 is highly skewed (γ1 = 41.76877766) | Skewed |
201106 is highly skewed (γ1 = 39.09144293) | Skewed |
201107 is highly skewed (γ1 = 51.71521442) | Skewed |
201108 is highly skewed (γ1 = 35.03313489) | Skewed |
201109 is highly skewed (γ1 = 33.35022294) | Skewed |
201110 is highly skewed (γ1 = 36.05642898) | Skewed |
201111 is highly skewed (γ1 = 62.12608601) | Skewed |
201112 is highly skewed (γ1 = 56.96227946) | Skewed |
CustomerID has unique values | Unique |
201012 has 3423 (78.3%) zeros | Zeros |
201101 has 3591 (82.1%) zeros | Zeros |
201102 has 3574 (81.7%) zeros | Zeros |
201103 has 3354 (76.7%) zeros | Zeros |
201104 has 3474 (79.5%) zeros | Zeros |
201105 has 3294 (75.3%) zeros | Zeros |
201106 has 3323 (76.0%) zeros | Zeros |
201107 has 3381 (77.3%) zeros | Zeros |
201108 has 3391 (77.6%) zeros | Zeros |
201109 has 3073 (70.3%) zeros | Zeros |
201110 has 2952 (67.5%) zeros | Zeros |
201111 has 2666 (61.0%) zeros | Zeros |
201112 has 3689 (84.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-02 07:44:58.314098 |
|---|---|
| Analysis finished | 2022-11-02 07:45:18.653955 |
| Duration | 20.34 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 4372 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15299.67772 |
|---|---|
| Minimum | 12346 |
| Maximum | 18287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 12346 |
|---|---|
| 5-th percentile | 12613.55 |
| Q1 | 13812.75 |
| median | 15300.5 |
| Q3 | 16778.25 |
| 95-th percentile | 17984.45 |
| Maximum | 18287 |
| Range | 5941 |
| Interquartile range (IQR) | 2965.5 |
Descriptive statistics
| Standard deviation | 1722.390705 |
|---|---|
| Coefficient of variation (CV) | 0.1125769272 |
| Kurtosis | -1.195793327 |
| Mean | 15299.67772 |
| Median Absolute Deviation (MAD) | 1483.5 |
| Skewness | 0.0009180495309 |
| Sum | 66890191 |
| Variance | 2966629.742 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 12346 | 1 | < 0.1% | |
| 16282 | 1 | < 0.1% | |
| 16295 | 1 | < 0.1% | |
| 16293 | 1 | < 0.1% | |
| 16292 | 1 | < 0.1% | |
| 16287 | 1 | < 0.1% | |
| 16284 | 1 | < 0.1% | |
| 16283 | 1 | < 0.1% | |
| 16281 | 1 | < 0.1% | |
| 16222 | 1 | < 0.1% | |
| Other values (4362) | 4362 | 99.8% |
| Value | Count | Frequency (%) | |
| 12346 | 1 | < 0.1% | |
| 12347 | 1 | < 0.1% | |
| 12348 | 1 | < 0.1% | |
| 12349 | 1 | < 0.1% | |
| 12350 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18287 | 1 | < 0.1% | |
| 18283 | 1 | < 0.1% | |
| 18282 | 1 | < 0.1% | |
| 18281 | 1 | < 0.1% | |
| 18280 | 1 | < 0.1% |
| Distinct | 931 |
|---|---|
| Distinct (%) | 21.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 171.3076441 |
|---|---|
| Minimum | -1192.2 |
| Maximum | 194353 |
| Zeros | 3423 |
| Zeros (%) | 78.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1192.2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 571.621 |
| Maximum | 194353 |
| Range | 195545.2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3020.678231 |
|---|---|
| Coefficient of variation (CV) | 17.63306154 |
| Kurtosis | 3911.443308 |
| Mean | 171.3076441 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 61.08787263 |
| Sum | 748957.02 |
| Variance | 9124496.977 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3423 | 78.3% | |
| 326.4 | 3 | 0.1% | |
| 510 | 2 | < 0.1% | |
| -1.25 | 2 | < 0.1% | |
| 112.6 | 2 | < 0.1% | |
| 254.4 | 2 | < 0.1% | |
| -17 | 2 | < 0.1% | |
| -2.55 | 2 | < 0.1% | |
| 102 | 2 | < 0.1% | |
| 156.65 | 2 | < 0.1% | |
| Other values (921) | 930 | 21.3% |
| Value | Count | Frequency (%) | |
| -1192.2 | 1 | < 0.1% | |
| -583.68 | 1 | < 0.1% | |
| -295.09 | 1 | < 0.1% | |
| -238.2 | 1 | < 0.1% | |
| -227.44 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 194353 | 1 | < 0.1% | |
| 27834.61 | 1 | < 0.1% | |
| 19950.66 | 1 | < 0.1% | |
| 13112.52 | 1 | < 0.1% | |
| 8591.88 | 1 | < 0.1% |
| Distinct | 772 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 128.0878911 |
|---|---|
| Minimum | -1241.43 |
| Maximum | 84925.88 |
| Zeros | 3591 |
| Zeros (%) | 82.1% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1241.43 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 538.3635 |
| Maximum | 84925.88 |
| Range | 86167.31 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1483.319569 |
|---|---|
| Coefficient of variation (CV) | 11.5804824 |
| Kurtosis | 2489.296707 |
| Mean | 128.0878911 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 45.79007171 |
| Sum | 560000.26 |
| Variance | 2200236.944 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3591 | 82.1% | |
| 165 | 3 | 0.1% | |
| -2.95 | 2 | < 0.1% | |
| 556.46 | 2 | < 0.1% | |
| -5.1 | 2 | < 0.1% | |
| -30.6 | 2 | < 0.1% | |
| 681.05 | 2 | < 0.1% | |
| 179.1 | 2 | < 0.1% | |
| 69.6 | 2 | < 0.1% | |
| -32.85 | 2 | < 0.1% | |
| Other values (762) | 762 | 17.4% |
| Value | Count | Frequency (%) | |
| -1241.43 | 1 | < 0.1% | |
| -1126 | 1 | < 0.1% | |
| -855.76 | 1 | < 0.1% | |
| -419.4 | 1 | < 0.1% | |
| -197.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 84925.88 | 1 | < 0.1% | |
| 26476.68 | 1 | < 0.1% | |
| 22998.4 | 1 | < 0.1% | |
| 18620.2 | 1 | < 0.1% | |
| 16774.72 | 1 | < 0.1% |
| Distinct | 784 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.9210087 |
|---|---|
| Minimum | -1132.08 |
| Maximum | 61516.5 |
| Zeros | 3574 |
| Zeros (%) | 81.7% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1132.08 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 510.013 |
| Maximum | 61516.5 |
| Range | 62648.58 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1084.902947 |
|---|---|
| Coefficient of variation (CV) | 9.523291264 |
| Kurtosis | 2400.912897 |
| Mean | 113.9210087 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 44.60023013 |
| Sum | 498062.65 |
| Variance | 1177014.404 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3574 | 81.7% | |
| 165 | 4 | 0.1% | |
| 102.9 | 2 | < 0.1% | |
| 209.25 | 2 | < 0.1% | |
| 303 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| -2.95 | 2 | < 0.1% | |
| 337.2 | 2 | < 0.1% | |
| 142.4 | 2 | < 0.1% | |
| 299.75 | 2 | < 0.1% | |
| Other values (774) | 778 | 17.8% |
| Value | Count | Frequency (%) | |
| -1132.08 | 1 | < 0.1% | |
| -331.5 | 1 | < 0.1% | |
| -186.35 | 1 | < 0.1% | |
| -152.64 | 1 | < 0.1% | |
| -102.58 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 61516.5 | 1 | < 0.1% | |
| 22752.46 | 1 | < 0.1% | |
| 14022.92 | 1 | < 0.1% | |
| 10535.48 | 1 | < 0.1% | |
| 7709.59 | 1 | < 0.1% |
| Distinct | 1004 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 156.2824977 |
|---|---|
| Minimum | -555.9 |
| Maximum | 103302.47 |
| Zeros | 3354 |
| Zeros (%) | 76.7% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -555.9 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 626.4975 |
| Maximum | 103302.47 |
| Range | 103858.37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1694.180808 |
|---|---|
| Coefficient of variation (CV) | 10.84050251 |
| Kurtosis | 3154.731748 |
| Mean | 156.2824977 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 52.70101885 |
| Sum | 683267.08 |
| Variance | 2870248.611 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3354 | 76.7% | |
| 600 | 2 | < 0.1% | |
| -2.95 | 2 | < 0.1% | |
| 251.52 | 2 | < 0.1% | |
| 374.64 | 2 | < 0.1% | |
| 183.6 | 2 | < 0.1% | |
| 289 | 2 | < 0.1% | |
| 251.56 | 2 | < 0.1% | |
| 141.41 | 2 | < 0.1% | |
| 307.5 | 2 | < 0.1% | |
| Other values (994) | 1000 | 22.9% |
| Value | Count | Frequency (%) | |
| -555.9 | 1 | < 0.1% | |
| -195.5 | 1 | < 0.1% | |
| -76.3 | 1 | < 0.1% | |
| -60.35 | 1 | < 0.1% | |
| -53.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 103302.47 | 1 | < 0.1% | |
| 21462.4 | 1 | < 0.1% | |
| 16558.14 | 1 | < 0.1% | |
| 13500.5 | 1 | < 0.1% | |
| 12992.4 | 1 | < 0.1% |
| Distinct | 885 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 112.8104119 |
|---|---|
| Minimum | -1591.2 |
| Maximum | 67159.27 |
| Zeros | 3474 |
| Zeros (%) | 79.5% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1591.2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 512.7645 |
| Maximum | 67159.27 |
| Range | 68750.47 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1078.767912 |
|---|---|
| Coefficient of variation (CV) | 9.562662643 |
| Kurtosis | 3417.768755 |
| Mean | 112.8104119 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 55.3589938 |
| Sum | 493207.121 |
| Variance | 1163740.208 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3474 | 79.5% | |
| 300.25 | 2 | < 0.1% | |
| -7.95 | 2 | < 0.1% | |
| 169.5 | 2 | < 0.1% | |
| -4.95 | 2 | < 0.1% | |
| 816 | 2 | < 0.1% | |
| 642 | 2 | < 0.1% | |
| -12.75 | 2 | < 0.1% | |
| 244.5 | 2 | < 0.1% | |
| -29.85 | 2 | < 0.1% | |
| Other values (875) | 880 | 20.1% |
| Value | Count | Frequency (%) | |
| -1591.2 | 1 | < 0.1% | |
| -1462.5 | 1 | < 0.1% | |
| -155.52 | 1 | < 0.1% | |
| -143.7 | 1 | < 0.1% | |
| -131.41 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 67159.27 | 1 | < 0.1% | |
| 9656.85 | 1 | < 0.1% | |
| 7325.84 | 1 | < 0.1% | |
| 6367.2 | 1 | < 0.1% | |
| 4572.32 | 1 | < 0.1% |
| Distinct | 1063 |
|---|---|
| Distinct (%) | 24.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 165.446823 |
|---|---|
| Minimum | -3585.84 |
| Maximum | 75082.43 |
| Zeros | 3294 |
| Zeros (%) | 75.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -3585.84 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 686.5395 |
| Maximum | 75082.43 |
| Range | 78668.27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1361.078888 |
|---|---|
| Coefficient of variation (CV) | 8.22668495 |
| Kurtosis | 2159.444909 |
| Mean | 165.446823 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.76877766 |
| Sum | 723333.51 |
| Variance | 1852535.741 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3294 | 75.3% | |
| -7.5 | 3 | 0.1% | |
| 185.22 | 2 | < 0.1% | |
| 214.5 | 2 | < 0.1% | |
| 251.02 | 2 | < 0.1% | |
| -9.9 | 2 | < 0.1% | |
| 1032.46 | 2 | < 0.1% | |
| 281.66 | 2 | < 0.1% | |
| 104 | 2 | < 0.1% | |
| 179 | 2 | < 0.1% | |
| Other values (1053) | 1059 | 24.2% |
| Value | Count | Frequency (%) | |
| -3585.84 | 1 | < 0.1% | |
| -262.8 | 1 | < 0.1% | |
| -103.3 | 1 | < 0.1% | |
| -41.3 | 1 | < 0.1% | |
| -27 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 75082.43 | 1 | < 0.1% | |
| 28408.14 | 1 | < 0.1% | |
| 18165.74 | 1 | < 0.1% | |
| 18025.68 | 1 | < 0.1% | |
| 12691.16 | 1 | < 0.1% |
| Distinct | 1036 |
|---|---|
| Distinct (%) | 23.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 158.0793962 |
|---|---|
| Minimum | -608.84 |
| Maximum | 83109.96 |
| Zeros | 3323 |
| Zeros (%) | 76.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -608.84 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 591.378 |
| Maximum | 83109.96 |
| Range | 83718.8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1588.492908 |
|---|---|
| Coefficient of variation (CV) | 10.04870303 |
| Kurtosis | 1842.541565 |
| Mean | 158.0793962 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 39.09144293 |
| Sum | 691123.12 |
| Variance | 2523309.718 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3323 | 76.0% | |
| 135 | 2 | < 0.1% | |
| 134.1 | 2 | < 0.1% | |
| 105 | 2 | < 0.1% | |
| 35.4 | 2 | < 0.1% | |
| 302.72 | 2 | < 0.1% | |
| 472.01 | 2 | < 0.1% | |
| 181.3 | 2 | < 0.1% | |
| 500 | 2 | < 0.1% | |
| 158.85 | 2 | < 0.1% | |
| Other values (1026) | 1031 | 23.6% |
| Value | Count | Frequency (%) | |
| -608.84 | 1 | < 0.1% | |
| -330.12 | 1 | < 0.1% | |
| -209.5 | 1 | < 0.1% | |
| -195 | 1 | < 0.1% | |
| -167 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 83109.96 | 1 | < 0.1% | |
| 41959.44 | 1 | < 0.1% | |
| 25288.99 | 1 | < 0.1% | |
| 23426.81 | 1 | < 0.1% | |
| 20427.98 | 1 | < 0.1% |
| Distinct | 976 |
|---|---|
| Distinct (%) | 22.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 155.8325963 |
|---|---|
| Minimum | -4287.63 |
| Maximum | 107061.63 |
| Zeros | 3381 |
| Zeros (%) | 77.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -4287.63 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 637.343 |
| Maximum | 107061.63 |
| Range | 111349.26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1771.125212 |
|---|---|
| Coefficient of variation (CV) | 11.36556314 |
| Kurtosis | 3054.471917 |
| Mean | 155.8325963 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 51.71521442 |
| Sum | 681300.111 |
| Variance | 3136884.517 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3381 | 77.3% | |
| -8.5 | 3 | 0.1% | |
| 346.38 | 2 | < 0.1% | |
| 360.54 | 2 | < 0.1% | |
| 319.85 | 2 | < 0.1% | |
| 110.14 | 2 | < 0.1% | |
| 229.14 | 2 | < 0.1% | |
| 130.2 | 2 | < 0.1% | |
| 312.9 | 2 | < 0.1% | |
| 371.24 | 2 | < 0.1% | |
| Other values (966) | 972 | 22.2% |
| Value | Count | Frequency (%) | |
| -4287.63 | 1 | < 0.1% | |
| -1592.49 | 1 | < 0.1% | |
| -1000.37 | 1 | < 0.1% | |
| -717.23 | 1 | < 0.1% | |
| -611.86 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 107061.63 | 1 | < 0.1% | |
| 26464.99 | 1 | < 0.1% | |
| 19889.16 | 1 | < 0.1% | |
| 13445.33 | 1 | < 0.1% | |
| 11590.58 | 1 | < 0.1% |
| Distinct | 966 |
|---|---|
| Distinct (%) | 22.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 156.1483326 |
|---|---|
| Minimum | -485.14 |
| Maximum | 66312.51 |
| Zeros | 3391 |
| Zeros (%) | 77.6% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -485.14 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 645.8615 |
| Maximum | 66312.51 |
| Range | 66797.65 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1352.922537 |
|---|---|
| Coefficient of variation (CV) | 8.66434188 |
| Kurtosis | 1508.356844 |
| Mean | 156.1483326 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 35.03313489 |
| Sum | 682680.51 |
| Variance | 1830399.392 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3391 | 77.6% | |
| -15 | 3 | 0.1% | |
| 312.14 | 2 | < 0.1% | |
| 301.92 | 2 | < 0.1% | |
| 301.7 | 2 | < 0.1% | |
| -17 | 2 | < 0.1% | |
| -9.9 | 2 | < 0.1% | |
| 115.5 | 2 | < 0.1% | |
| 76.32 | 2 | < 0.1% | |
| 304.2 | 2 | < 0.1% | |
| Other values (956) | 962 | 22.0% |
| Value | Count | Frequency (%) | |
| -485.14 | 1 | < 0.1% | |
| -344.94 | 1 | < 0.1% | |
| -220.47 | 1 | < 0.1% | |
| -134.55 | 1 | < 0.1% | |
| -125 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 66312.51 | 1 | < 0.1% | |
| 39655.81 | 1 | < 0.1% | |
| 21880.44 | 1 | < 0.1% | |
| 21149.05 | 1 | < 0.1% | |
| 15952.38 | 1 | < 0.1% |
| Distinct | 1288 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 233.2313866 |
|---|---|
| Minimum | -561.6 |
| Maximum | 88588.23 |
| Zeros | 3073 |
| Zeros (%) | 70.3% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -561.6 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 153.045 |
| 95-th percentile | 829.6875 |
| Maximum | 88588.23 |
| Range | 89149.83 |
| Interquartile range (IQR) | 153.045 |
Descriptive statistics
| Standard deviation | 2015.042361 |
|---|---|
| Coefficient of variation (CV) | 8.639670631 |
| Kurtosis | 1272.14234 |
| Mean | 233.2313866 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 33.35022294 |
| Sum | 1019687.622 |
| Variance | 4060395.715 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3073 | 70.3% | |
| -1.65 | 3 | 0.1% | |
| 1304.04 | 2 | < 0.1% | |
| -4.95 | 2 | < 0.1% | |
| 118.8 | 2 | < 0.1% | |
| 115.2 | 2 | < 0.1% | |
| 75 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 331.14 | 2 | < 0.1% | |
| 228.96 | 2 | < 0.1% | |
| Other values (1278) | 1280 | 29.3% |
| Value | Count | Frequency (%) | |
| -561.6 | 1 | < 0.1% | |
| -213.68 | 1 | < 0.1% | |
| -133.15 | 1 | < 0.1% | |
| -115.43 | 1 | < 0.1% | |
| -93.18 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 88588.23 | 1 | < 0.1% | |
| 70246.5 | 1 | < 0.1% | |
| 49622.18 | 1 | < 0.1% | |
| 26750.7 | 1 | < 0.1% | |
| 21012.17 | 1 | < 0.1% |
| Distinct | 1400 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 244.9004277 |
|---|---|
| Minimum | -983.87 |
| Maximum | 96099.63 |
| Zeros | 2952 |
| Zeros (%) | 67.5% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -983.87 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 188.5425 |
| 95-th percentile | 897.9025 |
| Maximum | 96099.63 |
| Range | 97083.5 |
| Interquartile range (IQR) | 188.5425 |
Descriptive statistics
| Standard deviation | 1919.974088 |
|---|---|
| Coefficient of variation (CV) | 7.839815169 |
| Kurtosis | 1596.73994 |
| Mean | 244.9004277 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 36.05642898 |
| Sum | 1070704.67 |
| Variance | 3686300.499 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2952 | 67.5% | |
| -3.75 | 3 | 0.1% | |
| 118.8 | 3 | 0.1% | |
| -15 | 3 | 0.1% | |
| 180 | 3 | 0.1% | |
| 158.4 | 2 | < 0.1% | |
| 300.76 | 2 | < 0.1% | |
| 313.4 | 2 | < 0.1% | |
| 616.8 | 2 | < 0.1% | |
| 240.55 | 2 | < 0.1% | |
| Other values (1390) | 1398 | 32.0% |
| Value | Count | Frequency (%) | |
| -983.87 | 1 | < 0.1% | |
| -788.38 | 1 | < 0.1% | |
| -468.32 | 1 | < 0.1% | |
| -442.89 | 1 | < 0.1% | |
| -134.96 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 96099.63 | 1 | < 0.1% | |
| 52681.27 | 1 | < 0.1% | |
| 39995.95 | 1 | < 0.1% | |
| 19180.9 | 1 | < 0.1% | |
| 17433.29 | 1 | < 0.1% |
| Distinct | 1672 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 334.3449794 |
|---|---|
| Minimum | -295.73 |
| Maximum | 329494.22 |
| Zeros | 2666 |
| Zeros (%) | 61.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -295.73 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 280.515 |
| 95-th percentile | 1013.73 |
| Maximum | 329494.22 |
| Range | 329789.95 |
| Interquartile range (IQR) | 280.515 |
Descriptive statistics
| Standard deviation | 5087.51706 |
|---|---|
| Coefficient of variation (CV) | 15.21637044 |
| Kurtosis | 4011.982885 |
| Mean | 334.3449794 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 62.12608601 |
| Sum | 1461756.25 |
| Variance | 25882829.84 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2666 | 61.0% | |
| 178.26 | 3 | 0.1% | |
| 244.86 | 2 | < 0.1% | |
| 292.34 | 2 | < 0.1% | |
| -12.75 | 2 | < 0.1% | |
| 486.96 | 2 | < 0.1% | |
| 97.5 | 2 | < 0.1% | |
| 426.37 | 2 | < 0.1% | |
| 133.68 | 2 | < 0.1% | |
| 114.6 | 2 | < 0.1% | |
| Other values (1662) | 1687 | 38.6% |
| Value | Count | Frequency (%) | |
| -295.73 | 1 | < 0.1% | |
| -207.22 | 1 | < 0.1% | |
| -147.87 | 1 | < 0.1% | |
| -136.85 | 1 | < 0.1% | |
| -133 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 329494.22 | 1 | < 0.1% | |
| 27837.45 | 1 | < 0.1% | |
| 25375.41 | 1 | < 0.1% | |
| 24434.45 | 1 | < 0.1% | |
| 22536.21 | 1 | < 0.1% |
| Distinct | 674 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.19213403 |
|---|---|
| Minimum | -436.2 |
| Maximum | 91161.63 |
| Zeros | 3689 |
| Zeros (%) | 84.4% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -436.2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 406.987 |
| Maximum | 91161.63 |
| Range | 91597.83 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1452.168439 |
|---|---|
| Coefficient of variation (CV) | 14.63995561 |
| Kurtosis | 3543.071957 |
| Mean | 99.19213403 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 56.96227946 |
| Sum | 433668.01 |
| Variance | 2108793.176 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3689 | 84.4% | |
| -8.3 | 3 | 0.1% | |
| 232 | 2 | < 0.1% | |
| 442.9 | 2 | < 0.1% | |
| 215.25 | 2 | < 0.1% | |
| 264.91 | 2 | < 0.1% | |
| 128.76 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| -1.65 | 2 | < 0.1% | |
| 164.68 | 2 | < 0.1% | |
| Other values (664) | 664 | 15.2% |
| Value | Count | Frequency (%) | |
| -436.2 | 1 | < 0.1% | |
| -381.95 | 1 | < 0.1% | |
| -336 | 1 | < 0.1% | |
| -172.89 | 1 | < 0.1% | |
| -125 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 91161.63 | 1 | < 0.1% | |
| 12393.7 | 1 | < 0.1% | |
| 11728.02 | 1 | < 0.1% | |
| 11485.54 | 1 | < 0.1% | |
| 7835.54 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| CustomerID | 201012 | 201101 | 201102 | 201103 | 201104 | 201105 | 201106 | 201107 | 201108 | 201109 | 201110 | 201111 | 201112 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 12346 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 1 | 12347 | 711.79 | 475.39 | 0.0 | 0.00 | 636.25 | 0.0 | 382.52 | 0.0 | 584.91 | 0.0 | 1294.32 | 0.00 | 224.82 |
| 2 | 12348 | 892.80 | 227.44 | 0.0 | 0.00 | 367.00 | 0.0 | 0.00 | 0.0 | 0.00 | 310.0 | 0.00 | 0.00 | 0.00 |
| 3 | 12349 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 1757.55 | 0.00 |
| 4 | 12350 | 0.00 | 0.00 | 334.4 | 0.00 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 5 | 12352 | 0.00 | 0.00 | 296.5 | 304.68 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 632.5 | 0.00 | 311.73 | 0.00 |
| 6 | 12353 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 89.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 7 | 12354 | 0.00 | 0.00 | 0.0 | 0.00 | 1079.40 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 8 | 12355 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 459.4 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 9 | 12356 | 0.00 | 2271.62 | 0.0 | 0.00 | 481.46 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.00 | 58.35 | 0.00 |
Last rows
| CustomerID | 201012 | 201101 | 201102 | 201103 | 201104 | 201105 | 201106 | 201107 | 201108 | 201109 | 201110 | 201111 | 201112 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4362 | 18273 | 0.0 | 0.00 | 0.0 | 51.0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 102.0 | 0.00 | 0.00 | 51.00 |
| 4363 | 18274 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 4364 | 18276 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.0 | 335.86 | -12.50 | 0.00 |
| 4365 | 18277 | 0.0 | -12.75 | 0.0 | 0.0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.0 | 110.38 | 0.00 | 0.00 |
| 4366 | 18278 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 173.9 | 0.00 | 0.00 | 0.00 |
| 4367 | 18280 | 0.0 | 0.00 | 0.0 | 180.6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 4368 | 18281 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 0.00 | 80.82 | 0.00 | 0.00 | 0.0 | 0.00 | 0.00 | 0.00 |
| 4369 | 18282 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 0.00 | 0.00 | 0.00 | 98.76 | 0.0 | 0.00 | 0.00 | 77.84 |
| 4370 | 18283 | 0.0 | 215.00 | 102.9 | 0.0 | 117.68 | 99.47 | 307.53 | 143.19 | 0.00 | 134.9 | 114.65 | 651.56 | 208.00 |
| 4371 | 18287 | 0.0 | 0.00 | 0.0 | 0.0 | 0.00 | 765.28 | 0.00 | 0.00 | 0.00 | 0.0 | 1072.00 | 0.00 | 0.00 |